49 research outputs found

    A Simulation Suite for Lattice-Boltzmann based Real-Time CFD Applications Exploiting Multi-Level Parallelism on Modern Multi- and Many-Core Architectures

    Get PDF
    We present a software approach to hardware-oriented numerics which builds upon an augmented, previously published open-source set of libraries facilitating portable code development and optimisation on a wide range of modern computer architectures. In order to maximise eficiency, we exploit all levels of arallelism, including vectorisation within CPU cores, the Cell BE and GPUs, shared memory thread-level parallelism between cores, and parallelism between heterogeneous distributed memory resources in clusters. To evaluate and validate our approach, we implement a collection of modular building blocks for the easy and fast assembly and development of CFD applications based on the shallow water equations: We combine the Lattice-Boltzmann method with i-uid-structure interaction techniques in order to achieve real-time simulations targeting interactive virtual environments. Our results demonstrate that recent multi-core CPUs outperform the Cell BE, while GPUs are significantly faster than conventional multi-threaded SSE code. In addition, we verify good scalability properties of our application on small clusters

    A Simulation Suite for Lattice-Boltzmann based Real-Time CFD Applications Exploiting Multi-Level Parallelism on Modern Multi- and Many-Core Architectures

    Get PDF
    We present a software approach to hardware-oriented numerics which builds upon an augmented, previously published open-source set of libraries facilitating portable code development and optimisation on a wide range of modern computer architectures. In order to maximise eficiency, we exploit all levels of arallelism, including vectorisation within CPU cores, the Cell BE and GPUs, shared memory thread-level parallelism between cores, and parallelism between heterogeneous distributed memory resources in clusters. To evaluate and validate our approach, we implement a collection of modular building blocks for the easy and fast assembly and development of CFD applications based on the shallow water equations: We combine the Lattice-Boltzmann method with i-uid-structure interaction techniques in order to achieve real-time simulations targeting interactive virtual environments. Our results demonstrate that recent multi-core CPUs outperform the Cell BE, while GPUs are significantly faster than conventional multi-threaded SSE code. In addition, we verify good scalability properties of our application on small clusters

    HONEI: A collection of libraries for numerical computations targeting multiple processor architectures.

    Get PDF
    We present HONEI, an open-source collection of libraries offering a hardware oriented approach to numerical calculations. HONEI abstracts the hardware, and applications written on top of HONEI can be executed on a wide range of computer architectures such as CPUs, GPUs and the Cell processor. We demonstrate the flexibility and performance of our approach with two test applications, a Finite Element multigrid solver for the Poisson problem and a robust and fast simulation of shallow water waves. By linking against HONEI's libraries, we achieve a two-fold speedup over straight forward C++ code using HONEI's SSE backend, and additional 3--4 and 4--16 times faster execution on the Cell and a GPU. A second important aspect of our approach is that the full performance capabilities of the hardware under consideration can be exploited by adding optimised application-specific operations to the HONEI libraries. HONEI provides all necessary infrastructure for development and evaluation of such kernels, significantly simplifying their development

    Partitioned Coupling vs. Monolithic Block-Preconditioning Approaches for Solving Stokes-Darcy Systems

    Get PDF
    We consider the time-dependent Stokes-Darcy problem as a model case for the challenges involved in solving coupled systems. Keeping the model, its discretization, and the underlying numerics for the subproblems in the free-flow domain and the porous medium constant, we focus on different solver approaches for the coupled problem. We compare a partitioned coupling approach using the coupling library preCICE with a monolithic block-preconditioned one that is tailored to different formulations of the problem. Both approaches enable the reuse of already available iterative solvers and preconditioners, in our case, from theDuMuxframework. Our results indicate that the approaches can yield performance and scalability improvements compared to using direct solvers: Partitioned coupling is able to solve large problems faster if iterative solvers with suitable preconditioners are applied for the subproblems. The monolithic approach shows even stronger requirements on preconditioning, as standard simple solvers fail to converge. Our monolithic block preconditioning yields the fastest runtimes for large systems, but they vary strongly with the preconditioner configuration. Interestingly, using a specialized Uzawa preconditioner for the Stokes subsystem leads to overall increased runtimes compared to block preconditioners utilizing a more general algebraic multigrid. This highlights that optimizing for the non-coupled cases does not always pay off

    Irradiation-dependent topology optimization of metallization grid patterns and variation of contact layer thickness used for latitude-based yield gain of thin-film solar modules

    Get PDF
    We show that the concept of topology optimization for metallization grid patterns of thin-film solar devices can be applied to monolithically integrated solar cells. Different irradiation intensities favor different topological grid designs as well as a different thickness of the transparent conductive oxide (TCO) layer. For standard laboratory efficiency determination, an irradiation power of 1000W/m2^{2} is generally applied. However, this power rarely occurs for real-world solar modules operating at mid-latitude locations. Therefore, contact layer thicknesses and also lateral grid patterns should be optimized for lower irradiation intensities. This results in material production savings for the grid and TCO layer of up to 50 % and simultaneously a significant gain in yield of over 1% for regions with a low annual mean irradiation

    Ontologies for Models and Algorithms in Applied Mathematics and Related Disciplines

    Full text link
    In applied mathematics and related disciplines, the modeling-simulation-optimization workflow is a prominent scheme, with mathematical models and numerical algorithms playing a crucial role. For these types of mathematical research data, the Mathematical Research Data Initiative has developed, merged and implemented ontologies and knowledge graphs. This contributes to making mathematical research data FAIR by introducing semantic technology and documenting the mathematical foundations accordingly. Using the concrete example of microfracture analysis of porous media, it is shown how the knowledge of the underlying mathematical model and the corresponding numerical algorithms for its solution can be represented by the ontologies.Comment: Preprint of a Conference Paper to appear in the Proceeding of the 17th International Conference on Metadata and Semantics Researc
    corecore